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1 Summary 


The grovving knovvledge in genetics, epigenomics, epitranscriptomics and 
the 3D - or even 4D - genome structure provides an increasing number of 
detection targets that can be used to identify species or genetic lines, vvhether 
modified or not. Biotic and abiotic stresses also induce numerous unintentional 
genetic, epigenetic and epitranscriptomic modifications. Those changes are 
transmissible and can be orderediin regions and classified. The detection target 
is characterised by the mutagenesis technique used. For instance, the detection 
of transgenic GMO or SDN3 modification of nevv breeding techniques (NBTs) 
vvill target the iunctions of fragment insertions into the genome., Each insertion 
induces epigenetic, and probably epitranscriptomic, changes vvhich can also 
be targeted. İn addition, one group of markers is linked to the trait(s) introduced 
or modified by the breeder, vvhose sequence could be used in quantification 
and “screening" The other target vvill be a subset of the elements of a matrix 
approach (as desecribed elsevvhere in this book). General selection markers, 
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such as those used for plant breeding, together vvith mutagenesis technique- 
specific markers, could differentiate genetically modified organisms (GMOs) of 
any origin. They can be used to quantify and certify, through a global approach 
to the organism, that the trait modification is artefactual and not "natural" The 
grovving mastery of single-cell sequencing techniques should soon make 
it possible to differentiate the modifications, for example, due to each step 
of a CRİSPR-Cas transformation of cells in culture. This chapter vvill focus 
mainİy on detection targets based on nucleic acids, DNA and RNA, modified 
or unmodified, for their routine use in private and enforcement detection 
laboratories to compİy vvith food labelling and European traceability rules. 
The chapter uses the vvording “hidden” GMOs and “nevv” GMOs as defined by 
the French non-governmental organisations (NGOs) and farmers” union at the 
origin of the 2018 European Court of Hustice ruling. 


1.1 Introduction 


Traditional breeding resulted from 10 000 years in the domestication syndrome, 
vvhich is a genetic erosion over time causing a drastic reduction in the variability 
available for breeding in recent decades (Ahmar etal.,, 2020, Meyer etal., 2012). 
Genetic diversity is nevertheless necessary for the creation of nevv varieties 
adapted to the changing needs of agricultural production: adaptation to the 
pedoclimatic conditions of nevv production regions, resistance to constantly 
evolving parasites and pathogens, increase in yields, or finally, the production 
or the suppression of the production of molecules sought after or avoided by 
consumers or industry, all despite a reduced resilience of crops (Ceccarelli and 
Grando, 2020: Gepts, 2006, Louvvaars, 2018: Zampieri et al,, 2020a,b). 

The use of simple and generally single adapted traits is carried out by 
(i) time-consuming crossings betvveen more or less divergent populations of 
the domestic species, (ii) the introgression of traits from more or less related 
vvild species, (ili) the in vivo, and then in vitro random generation of mutations 
and epimutations and finally (iv) the introduction of sequences from other 
species by transgenesis to various techniques for modifying genomes and 
epigenomers (Glenn etal., 2017, Li etal., 2016, Miki and McHugh, 2004: Olsen 
and VVendel, 2013a, Van Tassel et al., 2020, Xue et al., 2019: VVitcombe et al., 
2013). Transgenesis techniques vvere vvell-received in the “?0s because they 
offered strong patent protection for the transgenic lines created, together 
vvith a rather relatively rapid return on investment for the simple introgressed 
traits. Transgenesis appeared to be an excellent economical alternative vvhere 
companies could commercialise nevv features, vvhile the unpredictability and 
the lack of means for selecting complex features shovved the bottlenecks of 
costly screenings of populations obtained from of in vitro random mutagenesis 
(Birch, 1997). Those issues of random mutagenesis vvere only solved vvith the 
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development after 2000 of the TILLING method (Targeting İnduced Local 
Lesions in Genomers), the implementations of massive screening platforms and 
the support of marker-assisted selection methods (McCallum et al., 2000a,b). 

After a fevvfailures, such as a market vvithdravval of delayed-ripening tomato 
and frost-resistant stravvberries, the first soybean crops from transgenesis 
arrived in European ports in 1996. GMOs intended for food and feed uses 
have met a very mixed reception in countries, particularly European ones, 
vvith a strong culinary tradition and productions under official quality signs 
(Frevver et al., 2013). As a result, their marketing in Europe is subiect to prior 
authorisation, vvith a traceability requirement to ensure that consumers can 
exercise their freedom of choice, and to separate food from drug or industrial 
GMOs, and, finally post-marketing environmental monitoring for those 
authorised for cultivation (dissemination). Genetically modified plants are 
generally considered a sandbox of technologies that mobilise huge economic 
means and interests before applications to animals. They are characterised by 
a constant battle betvveen retailers and seed companies to determine vvho 
effectively controls the food-processing supply chains, and require a labelling 
and traceability system, as it isthe standard for food in the EU (Bertheau, 2013a, 
Bonfini etal., 2001: Corrado, 2016: Dvvinger etal., 2007, Davison and Bertheau, 
2008). Additionnaly, several stakeholders also question the definition of GMOs 
in front of nevv techniques for modifying genomes and epigenomes. İn the 
meantime, citizens/consumers make a clear distinction through personal risk/ 
benefit analyses betvveen GMOs intended for food and those intended for 
therapeutic applications. 

GMOs are relative nevvcomers to food supply chains and are generally 
indistinguishable from homologs familiar to consumers. Thus, as previously seen, 
the various suppİy chains have developed reliable tools to identify their presence 
(industrial GMOs) and their (relative) absence in conventional products. 

The controversy about transgenic GMOs has continued vvith the arrival 
Of “nevv techniques” of genome and epigenome modifications, the NBTs, that 
could lead to nevv products vvhose status vvould, this time, differ entirely from 
one vvorld region to another. The controversy has also resurfaced vvith the recent 
rulings of the European Court of Gustice (EC.I) and the French Conseil d”Etat 
(FCE), vvhich recalled that, under the terms of Directive 2001/18, all products 
derived from in vitro techniques that had not been proven safe by a long history 
of use in 2001 vvere non-exempted GMOs. Besides the GMOs exempted from 
the application of the 2001/18 directive, vve thus novv have to distinguish the 
transgenic GMO, the “hidden” GMOs (due to in vitro random mutagenesis) and 
the "nevv" GMOs developed vvith NBT techniques. 

The basic idea oftransgenic, “hidden” and “nevv" GMOs is thatlife isa kind of 
large Meccano vvith interchangeable parts, a vievv reinforced by the large and 
rough syntenies observed betvveen species. İn this overall reductionist vision, 
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the transfer of genetic or epigenetic information should be easy because of 
a simple correlation betvveen genotype and phenotype. A trait vvould thus 
be easily transposable and reusable. Such a trait expressed in one gene pool 
vvould therefore be transferable, vvithout difficulty, to another gene pool. 

İndeed, the controversy reflects a Meccano understanding of the genome 
based onthe molecular biology ofthe “7Os and arisk-taking position, depending 
on vvhether or not unintentional modifications are taken into account as having 
impacts in unknovvn environmental circumstances (Deng etal, 2008, Ladics etal., 
2015, Odorico et al., 2018, VVeckvverth et al., 2004 ). For instance, this issue of 
considering or ignoring unintended changes raises the question of vvhether vve 
should be concerned that a simple point mutation, a Single-Nucleotide Variant 
(SNV), can influence gene expression by distaliy altering regions” proximity of 
different chromosomes in our current understanding of genome structure and 
regulation based on Topologically Associated Domain (TAD) (Finn and Misteli, 
2019). Similarly are silent mutations changing the proteins” 3D structure and 
enzymatic activities relevant to risk assessment (Anonymous, 2006, Chamary 
and Hurst, 2009, Kimchi-Sarfaty et al., 2007). 

İn this simplified vievv of genomes and epigenomes, the genomic data 
should have allovvedthe transfer of genes from one organism to another, hoping 
that the syntenies observeci betvveen species vvould allovv easy transpositions 
of the corresponding sequences” behaviour. Hovvever, a substantial complexity 
had to be facedi: the definition of genes is continuously evolving, and greater 
understanding has shovvn that genomes have a three-dimensional regulatory 
organisation vvith unexpected overlapping genes (Cattoni et al., 2017, 2015). 
Therefore, understanding the genotype-phenotype relationship currently 
remains a distant holy grail (Bennetzen et al.,, 2004, Gerstein et al., 2007, 
dakobson et al., 2019: Portin and VVilkins, 2017, Scherrer and (dost, 2007, Yaish, 
2017, Zhao et al., 2019). Regulation systems are far more complex than vvas 
expectedi in the 1970s (Gerstein et al., 2007, 2012, Hinman and Cary, 2017).An 
"encoding” sequence can give rise to several different proteins by, for example, 
alternative splicing into cellular netvvorks, by exon skipping or shuffling, intron 
retention, etc. that vary according to the cellular environment (Baralle and 
Giudice, 2017, Chaudhary et al,, 2019, Clark et al., 2019, Dvinge, 2018: Park 
et al,, 2018). Moonlighting proteins are proteins vvhose functions may differ 
according to the phase of the cell eycle (Chen et al., 2018, Veffery, 2015, Singh 
and Bhalla, 2020). Finally, biotic environmental factors, such as the microbiota, 
for instance, are knovvn to interfere differentially vvith the plant genotype, 
some being pathogens in vitro but not in vivo (Li et al,, 2018, Ouambusch and 
VVinkelmann, 2018, Quiroga et al., 2017). All these factors are currentİy not 
considered in risk assessments and vill not be for the nevv GMOs. 

These numerous technical and biological bottlenecks affect the ability to 
rapidİy and accurately transfer genes, traits and regulations from one species to 
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another or even among cultivars of the same species. Despite this reality, a strong 
economy of promises has been developed around the NBT-derived products 
due to the enormous economic impact for seed companies, marketing patented 
seeds each year iin a race tovvards some technical innovation (Zhang et al,, 2016). 
Thischapter vvill examine vvhetherthe currentframe ofGMO detection, presented 
in the previous chapter, fits the purpose of routine detection, identification and 
quantification of “hidden” and “nevv” (NBT derived) GMOs by (i) identifying the 
sources and targets of detection, (ii) proposing a routine detection scheme and 
(ii) let suspect the technique at the origin of the GMO. 


2 Background to “hidden” and "nevv” GMOs 


Transgenic, “hidden" and “nevv” GMOs, vvhich are derived from NBT techniques 
and more particularly gene editing, have in common several in vitro steps 
vvhose importance in the GMO status has been recently reminded by the EC/ 
and FCE. VVe vvill mainİy focus on nuclear changes, vvhile numerous unintended 
modifications are also occurring in organelles. 


2.1 Nevv breeding techniques 


NBTs do not have a formal definition, as it has varied since its first release. 
This term appeared in 2007 follovving questioning of the Commission by the 
Dutch COGEMT as to the GMO status or not of products issued from various 
nevv techniques or from old ones renamedl. İn its first version, it included: the 
grafting of a non-GM scion on a GM grafting base, floral dip and agroinfiltration, 
and various techniques of modification of genomes and epigenomes by OdM?, 
RNAŞP, alteration by nucleases (meganucleases, ZFN, TALEN, then CRİSPR- 
Cas), null segregant, and finally synthetic biology vvith its multiple definitions 
(Regalado, 2015b, Schaart and Visser, 2009). They thus covered genetic, 
epigenetic and epitranscriptomic modifications, vvhile only the genetic ones 
are assessed in the GMO notifiers” dossiers. Atleast some ofthem have recently 
been renamed “Nevv/Novel GenomicTechniques"” bythe European Commission. 
VVithout a long and proven safety record prior to 2001, most ofthem have been 
recognised to be GMOs since the 2018 ruling by the EC1. 


2.2 Related techniques common to all the GMOs 
“Related techniques" involve all the in vitro techniques used for genetic and 


epigenetic modifications. These related techniques include (i) protoplastisation 


1 GMO Risk Assessment Agency 
2 Oligonucleotide directed Mutagenesis 
3 İnterfering RNA 
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to obtain isolated cells,fii) in vitro culture, (ili) delivery of reagents (DNA, RNA, or 
RNA-associated or non-RNA-associated proteins), (iv) the selection procedure 
of modifying cells, (v) the elimination of these same selection markers for 
example by the Cre-Lox recombinase system, (vi) the callus differentiation, 
and (vil) the regeneration of seedlings before (viii) the habituation to culture 
conditions. Aİİ these mutations and epimutations follovv a dynamic that has not 
been studied in detail until novv (Bertheau, 2019, Filipecki and Malepszy, 2006, 
Lee and Seo, 2018, Mendizabal et al.,, 2014). Hovvever, these old-chapped 
related techniques still constitute a bottleneck in large-scale technique 
applications that have not been mastered, as recently noted (Ledford, 2016). 
They are the source of the “somaclonal variation” a sum of genetic, epigenetic 
and epitranscriptomic unmastered changes. 

Protoplastisation, obtained by the pectocellulosic vvall degradation under 
controlled osmotic conditions, is the first stressing step tovvard isolated cell 
cultures and plant regeneration after genome and epigenome” modifications 
(Leva et al., 2012, Leva and Rinaldi, 2012). This protoplastisation and the sub- 
sequent culture stages induce a series of stresses, leading to somatic cell pluri- 
potency by as yet unknovvn mechanisms (Bidabadi and Xain, 2020, Caboche 
2010, Feher, 2015). These culture stages induce point mutations and indels 
together vvith more extensive changes such as chromosomal rearrangements, 
aneuploidy, transposable elements mobilisation, endoreplication, chromoth- 
ripsis, etc., vvhich also cause sensu İ/ato epimutations (Bednarek and Orlovvska, 
2020, Filipecki and Malepszy, 2006, Fossi et al., 2019, Henry et al,, 2018, Lang 
and Schnittger, 2020). 

AİI in vitro cultures of isolated cells, vvhether or not conducted for plant 
regeneration, induce somaclonal variation (random variations of genomes and 
epigenomes) usable in plant breeding (Bado et al., 2017, Bairu et al., 2011, 
Bidabadi and Xain, 2020: Filipecki and Malepszy, 2006, Karp, 1995, Krishna 
et al,, 2016, Li, X.-Q. 2017, Miguel and Marum, 2011, Smulders and de Klerk, 
2011, Stebbins, 1950). This variation corresponds to the cellular response to 
exposure to different environmental stresses, vvith the “environment” starting 
at the cellular eytoplasmic limit (Kucab et al,, 2019, Perrone and Martinelli, 
2020). İt can be detected even after several generations at the genetic and 
transcriptomic levels and maybe later atthe epitranscriptomic level (Bairu etal., 
2011, Batista et al., 2017, Bregitzer et al,, 1998, Flint-Garcia, 2013, Han et al., 
2018: Henikoff and Greally, 2016, Li, 2016, Martinez, 2018: Miyao et al., 2012, 
Perrone and Martinelli, 2020). The transmitted stress memory can even lead 
cells and their descendants to “choose” betvveen proliferation and quiescence 
(Filipecki and Malepszy, 2006: Kedziora and Purvis, 2017, Nathans et al., 2021, 
Yang et al., 2017). Finalİy, transgenerational epigenetics plays a much more 
critical role in evolution than initially expected (Quadrana and Colot, 2016, 
VVeigel and Colot, 2012, Yaish, 2017). 
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Each isolated cell can lead to ahomogeneous or chimericregenerated plant, 
vvhatever the modification system used - RNA, DNA or RiboNucleoProtein (RNP) 
(Liu, H.4. etal,, 2020, Meins vr, 1983) - asiitiis also observed for animals (Hennig 
et al,, 2020). An intercellular polymorphism develops because of the instability 
of genomes during cell multiplication phases tovvards plant regeneration, an 
observation similar to the chimaerism observed for monozygotic human tvvins 
(gonsson et al,, 2021). This chimaerism, induced before plant regeneration, 
remains unstudied for in vitro cultivated plants” characterisation and unnotified 
in petitioners” dossiers for risk assessment. The induced callus necessary for 
regeneration is more susceptible to mutations than a grovving plant (Altpeter 
et al,, 2016, Bidabadi and Hain, 2020, VVang et al., 2019). Extreme effects are 
also observable in the progeny, such as the chromothripsis observed for 
potatoes regenerated from protoplasts (Fossi et al., 2019, Henry et al., 2018, 
Koltsova et al., 2019). Indeed, chromothripsis micronuclei can be detectably 
transmitted to the progeny and induced by Cas9 SDN: (Cullot etal., 2019, Fossi 
et al., 2019: Leibovvitz et al.,, 2015, 2021). 

Conversely, the culture conditions used for the micro-propagation of 
certain plant species aim to reduce to a minimum this somaclonal variation and 
the induced chimaerism, vvhich can hovvever still be identified (Aravanopoulos, 
2003, Martinez, 2018, Ruffoni and Savona, 2013, Teixeira da Silva et al., 2007, 
Thorpe, 2013). 

The delivery of reagents, vvhether DNA, RNA (messenger or not) or proteins 
(such as RNA-associated nucleases for CRİSPR-Cas RNPs5 contaminated by 
foreign DNA ofthe multiplication organism DNA despite purification steps), is 
a highly empirical technical phase that induces genomic and epigenomic scars 
(Bertheau, 2019: Khalil, 2020, Ledford, 2016). 

Transgenesis and genome and epigenome editing methods share these 
related techniques and their unintendedi effects. The scars left after the excision 
of selection markers from modified cells is only one example (Ates et al., 2020, 
Bertheau, 2019, dansing et al,, 2019, Manimaran et al.,, 2011: Marx, 2015, 
Miguel and Marum, 2011: Ruffoni and Savona, 2013, Scahill etal,, 2008: Volkova 
et al., 2020, Yau and Stevvart, 2013). Therefore, most of the authors consider it 
necessary for the NBTto avoid all these in vitro culture phases, butthe suggested 
options have not proven themselves (Hamada et al., 2017, Maher etl al., 2020). 

Random changes in somaclonal variation have been knovvn for a long time. 
Still, little use vvas made of them after a fad phase, because of the difficulties in 
screening for mutations, their unpredictability and trouble managing them due 
to their instability (Evans, 1989). These variations became “easily” usable after 
2000, vvith molecular markers using the TILLING technique on high-throughput 


4 Sequence Directed Nuclease 
5 RiboNucleoProtein 
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and costly screening platforms, as previously seen. These identification tools 
are adaptable for characterizing a cultivar and thus a GMO or a class of GMOs 
(Bhargava and Sharma, 2013, Korir et al., 2013, Kvvon et al., 2005, Matsaunyane 
and Dubery, 2018, Singh etlal., 2019, Smith and Smith, 1992). 

As long as one vvants to take the trouble to identify the different resultant 
scars, there is no technical barrier to assemble mutations andı epimutations in 
databases to differentiate organisms” spontaneous mutations, vvhich are mostly 
neutral, or are induced in vivo or in vitro (Orr et al., 2020, Shahryary et al., 2020). 


2.3 Genome stability and stresses 


An organism is generally characterised by the transmission of an integrative 
genome acquired through a series of trial-and-error adaptations to an 
environment (Balestrazzi et al,, 2020, Nisa et al., 2019). The malority of stress- 
induced mutations and epimutations are eliminated in different vvays if the 
modifications are not neutral or are unsuitable for the environment, leading 
to the unviability of the organism (Daneva et al., 2016, Morrovv, 1975). This 
stability of genomes and epigenomes is one of the bases of the evolution 
and differentiation of species. Thus, only a meagre number of mutations can 
accumulate, as observed for the Napoleon cak, vvhich is not an isolated case 
vvhen considering multi-millennia trees and natural clones (Kuhlemelier, 2017, 
Ledford, 2017, Orr et al,, 2020, Sehmid-Siegert et al., 2017). The biotic and 
abiotic stress-induced mutations and epimutations are transmissible and 
distinguishable from the spontaneous ones (Ashapkin et al., 2016, Bednarek 
and Orlovvska, 2020, Hahn and Nekrasov, 2019, He and Li, 2018, Kellenberger 
etal., 2018, Perrone and Martinelli, 2020, Quadrana and Colot, 2016, Rath, 2018, 
Shahryary et al., 2020). The rates of mutations, conversions, recombinations, 
indels and heritable epimutations can vary by several orders of magnitude due 
to environmental stresses. These reactions to stresses can lead to hypermutation 
situations (Dulieu, 2005, Filipecki and Malepszy, 2006: Perrone and Martinelli, 
2020, Rath, 2018: Reddy et al., 2020, Roberts and Gordenin, 2014). Mutations 
and induced hypermutations can thus persist as signatures of past events. 
İndeed, among the hitherto unexploited factors of variability generated by 
mutagenesis, vve can cite (i) the differences in genomerss localisation, (ii) their 
predictability, (ili) the densities in the form of hotspots, (iv) their proximities, 
(v) their locations in regions frequently transcribed or not, non-coding and 
(vi) participating or not in structural rearrangements, according to (vii) plant 
lifetimes (Kumar et al., 2014: Renny-Byfield et al,, 2017, Rodgers-Melnick et al,, 
2015, 2016, Schmid-Siegert et al., 2017). 

All heterologous intra- or extracellular DNA induces “immune” (and 
inflammatory) responses in plant and animal cells. Cells can differentiate the 
extracellular DNA, particularly self/non-self, through its methylation profiles, and 
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be remarkably inhibited by the DNA of phylogenetically close plants (Monticolo 
et al,, 2020). İntrusive and extracellular heterologous DNA and RNA significantly 
impact, in a dose-dependent vvay, the cellular reactions and genome and 
epigenome orgahnisation (Bhat and Ryu, 2016: Monticolo et al., 2020). These 
reactions vvould be especially true vvhen a foreign vector is used after it has gone 
through in vitro phases that modified its self/non-self-identification pattern. VVith 
the biological mechanisms of resilience to stress, such as DNA damage repair, 
being constitutive, it is evident thatsome genomic and epigenomic scars vill be 
shared or even appear as methodsr” signatures. The boundary betvveen the tvvo 
unintentional modifications vvill sometimes be arbitrary. 

Nevertheless, the management of these data at the organism level vvill 
alvvays distinguish the “natural” from the “artefacts” affecting the genomes 
integrity (Bertheau, 2019: Bisvvas et al., 2020). 


3 Sources of detection targets 


Mutations and epimutations are sometimes detectable by phenotyping, but 
most often by ultra-deep sequencing and coverage and recently by Southern 
hybridisation (Holst-densen et al., 2016, Kishikavva et al., 2019, Mardis, 2013, 
Salk etc al., 2018, Sims etal., 2014: Takabatake et al., 2020a). Reliable detection 
needs to use the nearest genomes and epigenomes as comparators, 
thus sequencing at some cost the original unmodified variety instead of 
comparing to a consensus genome in databases (Germini et al., 2018). The 
ability to detect those unintended changes also relies on representative and 
dependable genomes and epigenomers, distinguishing the “core genome” 
and pangenomes (Hurgobin and Edvvards, 2017). Even these reference 
genomes remain difficult to constitute due to the numerous defects of 
sequencing techniques such as the preparation of sequencing libraries, the 
somatic mutations, and chromosomal rearrangements, notto mention the PCR 
amplification errors and significant imperfections of softvvare in assembling 
and comparing sequences (Bertheau, 2019, Bolukbasi et al., 2016, Ekblom 
and VVolf, 2014: Nevvman et al., 2020, Ravindran, 2018, Robin et al., 2016, 
VVienert et al., 2019). Building up genomes of quality is rare, even for human 
ones (Ballouz et al., 2019, Bertheau, 2019, Hart et al., 2014: Michael and 
VanBuren, 2020, Tello-Ruiz et al,, 2021). Accordingly, claims about the absence 
of unintended changes in genomes and epigenomes have to be examined 
vvith a very critical mind. 

More generally speaking, most of these NBT, such as grafts betvveen GMO 
and non-GM seions, have been very quickly recognised as generating GMOs 
by exchanging nucleic acids, organelles and epigenetic modifications (Hertle 
et al., 2021: Stegemann and Bock, 2009: Tsaballa et al,, 2021, VVarschefsky 
et al,, 2016). Some transient expression techniques have also been shovvn 
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to induce unintended insertional modifications depending on the target 
(protoplast, callus, tissues or the vvhole plant) (Canto, 2016, Tyurin et al., 
2020). They are also considered GMOs by their requirement of GM expression 
vectors providing signatures ofthe method (fyurin et al., 2020). Agroinfiltration 
and floral dip, initially included in NBTs for transient expressions or tissue 
transformations vvithout cell cultures, mostly lead to chimaeras, despite various 
improvements, and form parts of a "nevv” GMO signature (Maher et al., 2020, 
Matsuo et al., 2016, Zhang et al., 2020). İn some cases, the need to use several 
techniques simultaneously generates superimposed scars and signatures, 
including plastids, vvhile the extracted compounds are contaminated by toxic 
compounds and DNAs (Boettcher and McManus, 2015: Komarova et al., 2010, 
Mao et al., 2018, Sehillberg and Finnern, 2021: Tuse et al., 2020, Tyurin et al,, 
2020). RNAi may result from the GMO producing, for instance, dsRNA instead 
of toxic protein or sprays, targeting several non-target organisms (NTO) 
(Ivashuta et al., 2015, Khaluria et al., 2018). Due to numerous false positives, 
the accurate detection of those GM products may be difficult despite their 
numerous off-targets and other unintended modifications (Potrykus, 1991, 
Marine et al., 2012). 

Negative segregants are GMOs because they are derived from GMOs. 
Therefore, they carry the original GMO scars and signatures due to cell culture 
and related techniques (Bregitzer et al,, 1998, VVada et al., 2020). Finally, 
cisgenesis and intragenesis also lead to GMOs because they involve the in vitro 
manipulation of DNA and cell cultures that did not have a long record of safety 
before 2001 vvhile containing the usual scars and signatures. Moreover, the in 
vitro manipulation of DNA erases the self-non-self patterns that induce immune 
responses of plant celİs. The origin of the introduced sequencets) thus does not 
change the unintended effects and legal nature. 

VVe vvill novv focus on still disputed GMOs such as hidden” and “nevv 
GMOs. From a holistic point of vievv, of all kinds of GMO detection, four sources 
of detection are discernible, that is, (i) the selection signatures of cultivars and 
germplasms, (ii) the internal signatures of sequences used by laboratories for 
transforming plant celis, (ili) the scars due to the related in vitro techniques and 
finally, (iv) signatures resulting from the intended and unintended changes due 
to NBT techniques. 


3.1 Domestication syndrome and selection 


The repeated traditional selection over 10 000 years has leftselection signatures 
of human pressures in the peasants” landraces and the companies: cultivars 
(McLean-Rodriguez et al., 2021). 

İndeed, the variability betvveen Elite inbreds and correlated germplasms 
can reach tens of millions of detectable SNPs to be managed for identification 
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and genomic selection (Flint-Garcia, 2013, Hake and Ross-lbarra, 2015, VVilkesa 
et al., 2016). The selection signatures of both Elite and commercial varieties, 
vvhich differ, organised vvith the in vitro numerous scars and signatures, vvill 
constitute the first data usable in identifying the natural or artefactual status 
of modifications (Flint-Garcia, 2013, Grainger et al.,, 2018: Hill et al,, 2021, 
Olsen and VVendel, 2013a,b, Slotte, 2014, Xie etcal,, 2015). Due to the complex 
breeding schemes, vvhose results are more or less vvell predicted by modeling, 
germplasm uses are generally vvell correlated vvith seed companies” habits, 
available material, and knovv-hovv (Beckett et al., 2017, Cobb, 2019, Glenn 
et al,, 2017, van Heervvaarden et al.,, 2012a,b). This ascending traceability is 
complementary to species typology accessible to the various techniques, 
vvhich could be part of a screening stage after the integration into, e.g., decision 
support systems (lansing et al., 2019). 

İntegrating a nevvtrait resulting from the genetic or epigenetic modification 
of a more or less-related species into a commercial line requires introducing 
it beforehand into one or more of these Elite varieties. Nevertheless, the 
behaviour of this nevv trait in an Elite variety is difficult to predict (Crossa 
et al,, 2017, Evangelou et al., 2019), because the genetic distance betvveen the 
Elite types may be, as for corn, comparable to the one betvveen humans and 
chimpanzees and because prediction tools are still not accurate. Possessing a 
detailed knovvledge of the behaviour of Elite varieties, for vvhich onİy recently a 
fevv reference genomes have been avallable, is therefore an absolute necessity 
in plant breeding (Flint-Garcia, 2013). This gene pool remains the most 
influential factor in the expression of introgressed traits, vvhatever the reign of 
the living vvorld considered (Evangelou et al., 2019, Flint-Garcia, 2013, Fu etal,, 
2021: Mumm, 2013). 

These selection signatures are a primary element of suspicion ofa mutated 
origin of diversity, available even for unknovvn GMOs. 


3.2 NBT and transgenesis reagents delivery and expression tools 


The efficient delivery and expression of transgenic and NBT products is still 
a challenge for “aboratory” cultivars and experienced explants. Therefore, 
laboratories are using the same, somevvhat mastered methods of delivery, 
gene expression, and transformed cell selection, together vvith the previously 
experimentally testeci culture conditions and regeneration media. Altogether, 
the sequences of the delivery and expression cassette constitute a signature 
of the biotech laboratories, and even of the people that have transformed the 
isolated cell plants (Alley et al., 2020, Nielsen and Voigt, 2018). 
Agrobacterium is the most efficient delivery system vvith a lovv number of 
inserted copies and better stability of the insertions, but vvith residual evident 
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scars (Bertheau, 2019, Matsaunyane and Dubery, 2018). The DSB5, induced by 
the insertion of Agrobacterium sequences or biolistics, generates other kinds 
of identifiable genomic and epigenomic scars, as performed by foreign inva- 
sive DNA and gene-editing techniques. Therefore, bacterial plasmids and chro- 
mosomal insertions, and further rearrangements or epigenetic modifications 
are usual in the modified plant pangenomes (Ahmad et al., 2020, Anderson 
et al,, 2016, Bortesi and Fischer, 2015, Dalla Costa et al., 2020, Filipecki and 
Malepszy, 2006: Fossi et al., 2019, Gelvin, 2008: Grabovvska and Filipecki, 
2004: Gorbunova and Levy, 1999, Hahn and Nekrasov, 2019: Hilscher et al., 
2017, İto and Machida, 2015, Perrone and Martinelli, 2020, Reddy et al,, 2020, 
Serrano-Benitez et al., 2020, Ulker et al., 2008, Vitor et al,, 2020). Additionally, 
the delivering Agrobacterium strains used all belong to biovar 1, for vvhich 
a signature PCR detection test is available (Gelvin, 2003, Keane et al., 1970, 
Spiridonova et al., 2009). Many oftthe transformation strains contaminating the 
modified genomes share the same genetic background and delivering plas- 
mids. Such specificity can differentiate these reagents" vectors from the scars of 
aborted infections by natural strains (De Saeger et al., 2020: Tyurin etal., 2020). 

Biolistics, the other most-used delivery system for genetic and epigenetic 
modification reagents, has been used for many commercially available 
transgenic maize varieties. The small fragmented inserts allovv easy suspicion 
of the delivery system (Harkess, 2019, Latham et al., 2006, Takabatake 
et al,, 2020a,b, Yadava et al,, 2016). The use of microiniections, electroporation, 
viral vectors, polyethylene glycol nano vehicles, needles and other exotic 
reagents as delivery methods also provides signatures, such as chromosomal 
rearrangemehnis, in nuclei and organelles for several generations, for example, 
as episomes (Bertheau, 2019, Burgio and Teboul, 2020, Dalakouras et al., 2020, 
Filipecki and Malepszy, 2006, dansing et al., 2019, Lacroix and Citovsky, 2020, 
Liu et al., 2019, Lv et al., 2020). 

İndeed, the delivery system of DNA, RNA (interfering, messenger, etc.) or 
RNP is contaminated by foreign DNA because of its preparation in other living 
organisms. Therefore, this DNA can be foundiin the “edited” genome of modified 
protoplasts, as it has also been observed in animal cell cultures (Bertheau, 
2019, Norris et al,, 2020, Ono et al., 2019). In general, authors using “DNA-free” 
reagents do notappİythe minimum purification criteria such as those for “Minimal 
İnformation for Protein Functionality Evaluation: (MIPFE) proteins (Raynal et al., 
2014, Tan and Yiap, 2009, Zhang and Heyer, 2011). Since even the most purified 
comrercial enzymes are DNA-contaminated,, it is disgraceful to assert that RNP 
deliveries are DNA-free and that no foreign sequences have been inserted in the 
plants” genome and more generalİy speaking that NBT techniques do not leave 
contaminating DNA in the genomes of edited plants (Bertheau, 2019). 


6 Double Strand Break 
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The foreseen success did not follovv the initial claims about the interest 
of OdM, genome editing by meganucleases, floral dip and agroinfiltration. 
They became relatively anecdotal vvith the arrival of faster, more effective and 
“cheaper” techniques. The lack of reproducibility or versatility, lovv conversion 
frequency and DNA integration is also challenging (Anderson et al., 2008, 
Birch, 1997, Buehler et al., 2012, Germini et al., 2018: Housden and Perrimon, 
2016, Potrykus, 1991: Raitskin et al.,, 2019, Riba et al., 2017, Sheel and Xue, 
2016, Smits etal.,, 2019, Svoboda, 2020, VVang and Yang, 2019). As exemplified 
by the Cibus herbicide tolerant canola case, OdM-induced modification, a 
30-year-old technique, has clearly not been meeting, up to novv, the hopes 
initialiy placed on its use for plants. 

The current literature hardiİy retains any NBTs on plants, other than the 
improperly named genome editing techniques, using nucleases such as ZFN, 
TALEN and the numerous orthologs and variants of the Cas9 nucleases (dCas9, 
hCas9, HypaCas9, etc.) fused or not vvith other enzymes” sequences (Yee, 
2016). Hovvever, there are technical difficulties in applying these techniques, 
vvhich require numerous and complex optimisations of the types of reagents 
and delivery methods, taking into account the targeted species and tissues, 
the use of optimimized codons and the choice of promoters and inhibitors, for 
example, for Cas9 and gRNA expressions (Moon etal., 2019). These differences 
in optimisations and activity controls contribute to the diversification of scars 
and signatures of the techniques used and of the laboratories that perform 
them (Alley et al., 2020, Nielsen and Voigt, 2018). 

More generally, any sequence insertion (endogenous or heterologous, 
and therefore trans-, cis- or intragenesis) in the genome is accompanied by 
genetic and epigenetic unintentional modifications, sometimes distal, vvhich are 
identifiable patterns even for unknovvn GMOs (Ashapkin etal,, 2016, Filipecki and 
Malepszy, 2006, Kosicki et al., 2018, Ledford, 2020, Sullivan et al., 2015). This is 
in line vvith the self/non-self DNA and RNA recognitions (Monticolo et al., 2020). 

İn conclusion, all artefactual processes and tools, such as delivery systems, 
delivered sequences, culture media, and NBT by themselves are leaving 
numerous indices of all that has happened during cell cultures, transformation, 
selection and plant regeneration. This information can be serutinised and 
managed by tools such as artificial intelligence. 


3.3 Technical scars of mutageneses 


Scars are the set of genetic, epigenetic (DNA methylation and the "histone code”) 
and epitranscriptomic (RNA changes) modifications associatedi vvith the in vitro 
techniques on isolated cells. Certain scars may constitute specific signatures 
of the mutagenic agents or processes used (Destefanis et al., 2021, Filipecki 
and Malepszy, 2006). For instance, the Single Methylation Polymorphism 
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(SMP), an underused polymorphism, includes a set of markers transmissible in 
plants, as impressive as Single Nucleotide Polymorphism (SNP) or MNV?7, vvith 
different patterns related to the genetic characteristics (Ashapkin et al., 2016, 
Becker et al.,, 2011, Cattoni et al., 2017, Degalez et al., 2021, Schmitz et al., 
2013, VVilkesa et al., 2016). Transgenesis and genome and epigenome editing 
methods share these related techniques and their unintended effects. As seen 
above, the delivery systems and delivered sequences leave numerous specific 
genetic and epigenetic scars. 

As already mentioned, the selection of transformed cells remains topical 
due to the lovv transformation and regeneration efficiencies. Moreover, 
due to consumers: fears and the Agencies” recommendations, most of the 
authors attempt novv to remove some of them by, for example, the Cre-Lox 
recombination, thus leaving nevv specific scars (Ates et al., 2020, Bertheau, 
2019, Germini et al., 2018, dansing et al,, 2019, Miguel and Marum, 2011, 
Ruffoni and Savona, 2013, Volkova et al., 2020, Yau and Stevvart, 2013). VVhile 
antibiotic resistance and herbicide tolerances are less used due to the challenge 
of meeting the risky demands by the Agencies and consumers, other non- 
sensitive selection genes, such as phospho-mannose isomerase, are likely to be 
found in Elite and commercial varieties (Que et al., 2014: Yadava et al., 2016, Yau 
and Stevvart, 2013). Therefore, most authors consider it necessary to drastically 
increase plant transformation efficiency or avoid all these in vitro culture phases, 
butthe suggested options have not proven themselves to avoid or discard these 
transformedi cells” selection signatures (Hamada etlal,, 2017, Maher et al., 2020). 

İn several instances, similar to the specific cultivars used for many purposes, 
several protocols and expression cassettes have become almost the standard 
for plant transformation and regeneration (Que etal., 2014, Yadava et al., 2016), 
vvhich induce similar scars manageable by, for example, artificial intelligence or 
other tools usedi in plant breeding (Alley et al., 2020: Kim et al., 2020, Nielsen 
and Voigt, 2018, Tong and Nikoloski, 2021). 

As chromosomal rearrangements and chromothripsis have been 
observed due to related techniques or the action of CRISPR-Cas nucleases and 
somaclonal variation (Comai and Tan, 2019: Fossi et al., 2019, Leibovvitz et al., 
2015, 2021), the borderline betvveen scars and signatures may therefore be 
tenuous in some cases, even if the qualification does not matter, for identifying 
genetic and epigenetic artefactual modifications. 


3.4 Mutageneses" signatures 


By signature, vve mean either an unambiguous signature such as the border 
fragment of a random insertion in transgenesis, or a convergent set of data 


7 Multi-Nucleotide Variants 
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leading to a specific identification. İt is likely, bearing in mind that there are 
differences betvveen animals and plants, that many of the knovvn resulis in 
animals are valid for plants. This section vvill examine the signatures left by the 
several mutagenesis techniques applied to isolated cells - the in vivo random 
mutagenesis providing exempted GMOs. 

Organelles, such as chloroplasts, genomes and epigenomers, participate 
in stress tolerance, such as heat, but are rarely studied (Ahmad et al., 2017, 
Liberatore etal., 2016, VVallace, 2016, VVu etal.,, 2020). The organelles"” scars and 
signatures can be used in GMO identification as memories of past phenomena, 
especially since tiling microarrays allovv exploring their genomes (Hu et al., 
2020, Zmienko et al., 2011). They are already involved in cultivar identification. 
Hovvever, vve vvill focus belovv only on nucleus signatures. 


3.4.1 Somaclonal variation and transgenesis 


The frequencies of mutations and epimutations due to somaclonal variation are 
still relatively lovv. According to the IAEA, it is usual to increase them up to one 
million5 times by adding chemical or physical mutagens (Brash, 2015: Dulieu, 
2005, Kovalchuk et al., 2000). The development of the in vitro techniques 
has allovved such a svvitch from the previous in vivo techniques (Maluszynski 
et al,, 1995, 2000). Hovvever, as already mentioned earlier for the somaclonal 
variation, the in vitro resulting mutants vvere effectively used in plant breeding 
schemes only after the TILLING methodis publication (McCallum et al., 2000a, 
Tadele et al., 2009: Kurovvska et al., 2011). These mutagenesis techniques are 
already covered in a previous chapter of this book. 


3.4.2 In vitro-induced random mutageneses 


The signatures of various events suffered by in vitro and in vivo animal and plant 
cells can be differentiated, for example, by exposure to UVA and B, infrared, 
free radicals, etc. (Doitsidou et al., 2016, Flibotte et al., 2010, Lehrbach et al., 
2017, Li et al., 2015, Roberts and Gordenin, 2014: Smith and Yun, 2017, Vitor 
et al., 2020, Volkova et al., 2020, Xu et al., 2016). The changes induced by 
genotoxic agents, such as benzo(a)pyrene, hydrogen peroxide, aflatoxins, ethyl 
methanesulphonate (EMS),” UV or N-ethyl-N-nitrosourea (ENU), can also be 
mapped (Brash, 2015, Burns etal., 1986, Doitsidou etal., 2016, Fitzgerald et al.,, 
2017, Flibotte et al., 2010, Kim et al., 2006, Kucab et al., 2019, Lehrbach et al., 
2017, McCarthy et al., 2020, Saini et al,, 2020, Segovia et al., 2015, Sikora et al., 
2011: Simon et al., 2015, Volkova et al., 2020, VVei et al., 2021). 


8 http:/Avvvvv-navveb.iaea.org/nafa/pbg/mutation-induction.html 
9 Transition GC-to-AT 
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Although unstable because they are in high demand, some molecules, 
such asrDNA, could protect against mutations due to EMS and UV (DvoFaCkova 
et al,, 2015). Large deletions can be induced at the same locations (fragile 
chromosomal sites/recombination hotspots) by various mutagenesis agents 
such as EMS, fast neutrons, T-DNA, and X-rays, vvhich could result from an 
ancestral recombination event (dela Paz et al., 2012: Filipecki and Malepszy, 
2006: Rodgers-Melnick et al.,, 2015). Induced random mutations can lead to 
hypermutations, vvith some mutatedi sites appearing particularly unstable in cell 
cultures (Durkin and Glover, 2007). ldentifying these sites, particularly in active 
gene areas, can constitute the signatures of events in the affected cells. 

Several other biotic and abiotic mutagenic stresses such as drought and 
heating induce dramatic changes that are transmitted (Fleta-Soriano and 
Munne-Bosch, 2016: daligotand Rival, 2016, Probst and Mittelsten Scheid, 2015, 
VVibovvo et al., 2016). Chemical, biological and physical mutagens add a nevv 
layer of mutations and epimutations vvhose types, locations and frequencies 
may be differentiated from the somaclonal variation-induced changes. 


3.4.3 NBT signatures 


As Stella and Montoya (2016) remind us, it is technically difficult and costly 
to attempt detection by sequencing İovv- and medium-frequency off-targets. 
Moreover, off-target detection capabilities vary significantly forthe same sgRNA, 
gene-editing technique and the use of biased or “unbiased” detection methods 
(Martin et al,, 2016, Nevvman et al., 2020). Unintended modifications also vary 
betvveen clonal cells, vvhile unintended changes are searched oniİy in the cell 
populations, thus diluting the lovv-frequency unintended changes (Kimberland 
et al,, 2018, Sentmanat et al., 2018). Therefore, it is illusory to assert, as it is 
often done by several authors and stakeholders, that there vvould be no off- 
targets in a modified cell and that they vvill not be found in the progeny (Kang 
et al,, 2020, Stella and Montoya, 2016). RNAi and neRNA also induce many 
unintended  effects, vvhich can be used to identify processes (Ramesh, 2013, 
Ramon et al., 2014: Svoboda, 2020). 

The size of the recognised/targeted sequences (on- and off-targets) 
differs betvveen nuclease-based NBTs. Systematic on- and off-targets in the 
vicinity of, for example, a PAM sequence or of ZFN and TALEN:-binding sites 
vvould constitute such a pattern signature. Additionally, targeted sequence 
sizes differentiate ZFN, TALEN and CRİSPR-Cas techniques (see Fig. 1 for more 
details, danik et al,, 2020), as also shovvn for example, for the length of the 
artificial TALEN"€ cleaving site (half-site 14-20 nt - Fokl site 14-24 nt - half-site 


10 https://vvvvvv.medecinesciences.org/en/articles/medsci/full html/2014/02/medsci20143002p186/medsci20143 
002p186.html 
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14-20 nt) (Ahmad et al., 2020, Khalil, 2020). Moreover, specific sequences can 
be found near and at the enzyme cleavage and homologous sequence sites 
(e.g. different Protospacer Adiacent Motif (PAMs) for different GC contents in 
genomers, zinc finger sequences for ZFNs, etc.). More detailed studies should 
make it possible to define unambiguous signatures of nucleases such as 
those identified for Cas? and Cas12a (VVienert et al., 2019). The repair of the 
double-stranded cut caused by Cas9 vvould modify the degree of CGp islet 
methylation inthe case ofthe Homology-Directed Repair (HDR) butnotthe Non- 
Homologous End-/loining (NHE/) DNA repair mechanisms (Farris et al., 2020). 
Particular sequence-specificity of the DSB cut sites can also be distinguished 
(Molla and Yang, 2020). TALENss Fokl nuclease cuts require a Tin the 5: terminal 
position. TALEN also recognizes methylated eytosines compared to CRİSPR- 
Cas. The reagents are also differentiated by their chromatin affinity sites (dain 
et al., 2021). SMP polymorphism and its effect on genetic modification on 
Differentially Methylated Regions (DMRs) have not yet been studied in this case 
(Eichten and Borevitz, 2013, Li, Y. et al., 2017, Schmitz et al., 2013). All these 
“subtle” but clear differences betvveen nuclease-based gene-editing techniques 
are amenable in scars and signatures databases. The bioinformatic screening 
of these zones is manageable despite the thousands of PAM sequences or 
sequences recognized by ZFN and TALEN, vvhich can be monitored, compared 
to the tens of millions of SNPs used in GVVAST (Flint-Garcia, 2013, Li et al., 
2020). 

The vastnumber of publications on improving the specificity and efficiency, 
and reducing the number of off-target or unintentional on-target modifications 
shovv the vvidespread accidental side effects of NBT techniques (Rezza et al., 
2019). İn most cases, vve are unable to affirm that they are not scars from in vitro 
cultures and related techniques (Ahmad et al., 2020, Anderson et al., 2008, 
Cullot et al., 2019, Gauchier et al., 2020, Germini et al., 2018, Kavvall et all, 
2020, Khan et al., 2017, Naeem et al., 2020: Nevvman et al., 2020, Peng et al., 
2016, Thomas etal.,, 2019a,b, VVienert et al., 2019). Our inability to master these 
techniques is such that it is still recommended to continue vvorking vvith TALEN 
for some species rather than vvith CRISPR-Cas (Zain et al,, 2021, Khalil, 2020). 

The method of delivery of nucleases may also play a role in leaving a distinct 
signature or scar. For instance, vvhen expressedi via a plasmid, a highly specific 
Cas nuclease may be disappointing vvhen delivered as an RNP (Ates et al., 
2020). Depending onthe applied dose, RNPs can generate different amounts of 
microdeletion signatures (Momose etal., 2018).Similarly, the delivery of TALENs 
as either DNA or mRNA impacts the unintentional modifications (dansing et al., 
2019). Once again, a systematic study on the same model of the three forms 
of reagents (DNA, mRNA and RNP) of the nucleases and their variants should 
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establish a typology that can be used to identify the NBT used. Moreover, the 
RNAs used generally contain DNA that either contaminates or stabilises the 
RNAs to be introduced (Bertheau, 2019, dansing etlal., 2019). As already noted, 
the stabilising DNA from these frequent DNA/RNA hybridizations can integrate 
into the genome even during transient expressions. Therefore, the knovvledge 
of reagent delivery through patents and risk assessment dossiers vvould help 
identify the residues and scars of these modifications. 

The integration sites can also participate in the signature patterns: as in 
transgenesis, SDN3-type integrations, vvhich induce many unintended effects 
(Sehnell et al,, 2015), aim to target a “safe harbour" such as AAVS1 and Rose26 
in animals, for stable insertion and expression. Such safe harbours are quite 
unknovvn in plants, except possibly in one case of rice, but systematic studies 
vvould most likely identify nevv ones (Altpeter et al., 2016, Cantos et al., 2014, 
CGB, 2004: Papapetrou and Schambach, 2016, Perez-Gonzalez and Caro, 
2019, Schaart et al., 2016). İnsertions at these locations actively sought by 
breeders vvould participate in the signature of an artefactual event. Othervvise, 
integrations are almost performed at rearrangement, gene-rich sites, distal 
and often telomeric and subtelomeric chromosomal regions (Filipecki and 
Malepszy, 2006, Svitashev and Somers, 2002). These are fragile areas, such as 
telomere derived İTRs, some of vvhich are recombination hotspots or teloboxes 
(Aguilera and Garcia-Muse, 2013, Dvofa6kova et al., 2015: Yaish, 2017). Such 
characteristics may also be a part of the set of signatures or scars of genetic and 
epigenetic changes. 

Nuclease-encoding DNA insertions are still the most efficient modification 
system. Such an insertion-based system imperatively requires regulation 
(induction or repression) systems of gene expression to avoid further 
unintentional cuts (Canny et al., 2018, Chen et al., 2017). Alternatively, the Cas 
inhibitors may also be inserted (Davidson et al., 2020, Marino et al., 2020). 
Such inhibitory or regulatory systems can then be retrieved as signatures in the 
progeny of modified plants. 

The variants of the nucleases (dCas9, prime editing, base editor, etc.) do 
not seem to modify the ability to identify signatures. Moreover, these variants 
can also increase the frequency of off-targets (dansing et al., 2019). Indeed, 
modifications of enzymes such as a base editor vvould cause up to 20 times 
more off-targets than Cas9. Some of these techniques that vvork vvell in cell- 
free systems are disappointing vvhen applied to in vitro cultures due to the 
interference of cellular mechanisms and many unintendedi alterations and off- 
targets are then generated (Ahmad et al., 2020, Ates et al., 2020, Sin et all, 
2019, Zuo etlal., 2019). 

Carefulsequencing allovvsthe detection ofotherunintentional modifications 
(Frock et al.,, 2015), vvhich, like off- and on-target effects, depend on many 
factors (Modrzefevvski et al,, 2020: Modrzefevvski et al., 2018: Modrzefevvski 
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et al,, 2019). These unintendedi effects, the sources of signatures, appear more 
frequent than initially expected, despite the lack of strategies to detect them 
(Burgio and Teboul, 2020). Their exceptionally high number leads to vvhatsome 
have called “genome chaos” (Ledford, 2020). In other cases, the number of 
copies vvhich participate in the signature may increase (Skryabin et al., 2020). 

As the number of genes to be incorporated in the germplasm increases, 
the associated breeding programs become more complex. İt is, therefore, 
desirable for plant breeders to integrate at one time mulltiple traits at a fevv 
defined loci, preferably safe harbour, to facilitate subsequent breeding efforts. 
This proximity ofstacked traits vvill be in itself a sign ofan artefactual modification 
transmitted to the progeny. Moreover, the SDN3 or transgenic stacks could be 
gathered in clusters and delivered by a stack vector rather than being brought 
about by intricate crosses of Elite varieties vvhich is another kind of artefactual 
signature (Glenn et al., 2017, Oue et al,, 2014). 

The combination of sequencing, off-target prediction softvvare, and other 
tools, such as artificial intelligence, vvould allovv such NBT-induced modifications 
to be easily identified (Billon et al,, 2020, Manghvvar et al., 2020, Ravindiran, 
2018). 


3.5 The issue of false positive 


As in any mutagenesis technique, false positives of NBT are numerous, vvhile 
reproducibility is not the topical aim of biotechnologists (Bortesi and Fischer, 
2015, Lomov et al., 2019). The reality of the transformation must be verified 
because somaclonal variation can, in some cases, account for the observed 
phenotype (e.g., tolerance to an herbicide) (Anderson et al., 2008, Birch, 1997, 
Liu et al,, 2012, Potrykus, 1991). 

Numerous false positives and unintendedi effects undermine the reliability 
and acceptance of these techniques, as recently observed for Cibus” canola 
(Dong and Sharp, 2007, Oh and May, 2001: Ruiter et al., 2003, Voosen, 2010). 
The Cibus canola vvas one of the herbicide-tolerant crops (“hidden” GMO), 
vvhich led to the 2015 French legal suit. Since 2003, Cibus seems to be the only 
company to continue using the OdM technique on plants, a proprietary method 
called RTDS1? vvith more than 300 patents claimed (Beetham et al., 1999, 
Beetham et al., 2014). The announcedi success led to the commercialisation of 
sulphonylurea-tolerant rapeseed that vvas the subiect of an intense advertising 
and lobbying campaign vvith the European authorities (Editorial, 2015, 2017a, 
Fladung, 2016, Gocal, 2014: Hartung, 2020, Sauer et al.,, 2016, Songstad 
et al,, 2017). Hovvever, follovving the publication of an article describing hovv to 
detect it (Chhalliyil et al., 2020), this canola vvas “dovvngraded” by the company 
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and Canadian Agencies to the status of an in vitro culture somaclone. This false- 
positive ioins the doubts expressed for years by various authors on this NBT 
technique:s efficacy in plants (Dong and Sharp, 2007, Ruiter et al., 2003). As for 
Cibus, pressed by a concern for a rapid return on investment, companies may 
even patent data that do not correspond to the technical and claimed reality 
(CGB, 2004, Seymore, 2018), vvhich can lead to vvrong tracks in detecting these 
products and, atthe very least, vvaste much time and money. Distinguishing and 
quickly eliminating these false positives and negatives is essential to assess the 
reliability of the candidate scars and signatures for the detection/identification 
of NBT products. Currently, scientific papers and patents do not appear reliable 
enough, meaning thatthe analysts vvill have to gather experimental data unless 
mandatory legal obligations are set out. 

Nevv techniques for modifying genomes and epigenomes require either 
tissue transformations (in vivo or in vitro) or in vitro cultures of isolated cells. The 
latter is generally preferred forthe supposed absence or reduction of mosaicism 
in regenerated plants. This superimposition of unintentional modifications of 
the different steps increases the difficulties of correlation betvveen genotype 
and phenotype by its numerous false positives or negatives. Therefore, the 
interpretation of the results may be doubtful about the mode of production, 
generalİy presented in an opaque manner in patents (Seymore, 2018), as is 
reminded by the Cibus herbicide tolerant canolafs recent case. Avvare of this 
problem, the European Patent Office (EPO) has recently clarified the claims to 
be made by applicants (Part F IV 23): “fa technical feature of a claimed plant or 
animal, e.g. a single nucleotide exchange in the genome, might be the result 
of either a technical intervention (e.g. directed mutagenesis) or an essentially 
biological process (a natural allele), a disclaimer is necessary to delimit the 
claimed subiect matter to the technically produced product (see examples in 
G-II, 5.4.2.1, and G-İİ, 5.4). For the general principles governing disclaimers see 
H-V, 3.5, and H-V, 4: 

İn other vvords, due to the extreme lack of transparency by companies, 
applicants vvill have to specify vvith their patent claims the techniques vvhich 
they effectively used to obtain their product. 


4 Transmission of detection targets tovvards commercial 
varieties 


GMO proponents generally argue that unintended changes due to GMO 
production on non-recalcitrant genotypes and explants vvill be discarded 
during the time-consuming and costly backcrosses tovvard Elite varieties, 
vvhich are the head of germplasms and commercial cultivars (European Group 
on Ethics in Science and Nevv Technologies (EGE), 2021, Haut Conselil des 
Biotechnologies (HCB), 2017, Scientific Advice Mechanism (SAM), 2017). 
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Hovvever, the claim about the interest of this “cleaning” step is counterbalanced 
by the contradictory vvish of breeders to transform Elite varieties directly. İn 
any event, the currently available data are not in favour of the efficiency of 
backcrosses to discard unintended changes, as vve vvill see belovv. 


4.1 Examples of the persistence of genomic and epigenomic 
scars and signatures through the backcrosses 


Genetic and epigenetic modifications are primarily carried out on “laboratory 
varieties, vvhose specific explant material is easily transformable and 
regenerable (Bidabadi and Xain, 2020, Birch, 1997, Rao et al.,, 2009). Nevv 
trait(s) of interest are then introduced by backcrossing into Elite varieties before 
their transmission, through elaborate breeding schemes, into germplasm and 
comnmerçcial varieties (e.g. B73 is a founder of the Stiff Stalk germplasm pool 
of maize, vvhile PH207 is a founder of lodent germplasm). These “laboratory 
varieties” (such as Hi-ll (High type İl callus production) for maize, Topaze for 
rapeseed, Golden Promise for barley, etc.) are quite vvidely used in the same 
vvay, according to the breeding companies"” practices and knovv-hovv (Harvvood, 
2012, Lovve et al,, 2016, Yadava et al., 2016). Therefore, it vvould be possible 
to trace back to Elite varieties and “laboratory varieties” using the subsets of 
the markers used in the ISO and UPOV/ standards of cultivars” identifications, 
as already seen for lab source and selection signatures (ISO, 2019a,b,c, 2015, 
UPOV, 2019a,b,c,d,e, 2011). 

The general absence of vvhole-genome sequencing of transformed, Elite 
and commercial varieties and the vvider consideration of the vveaknesses 
in genome-vide variability and sequencing procedures make it impossible 
to iudge the reality of the theoretical efficiency of genome cleaning by 
backcrossing, (Bayer et al., 2020, Danilevicz et al., 2020, Faino and Thomma, 
2014, Golicz et al., 2020, Kang et al., 2020). İmperfect consensus genomes 
are also used for “unbiased” approaches for detecting off-targets and 
other unintended modifications. Sequencing depths generally remain too 
shallovv, vvhile fragment assembly and sequence comparison still shovv many 
vveaknesses (Bertheau, 2019, Cleary et al., 2015, Datta et al., 2010, Steinegger 
and Salzberg, 2020). Thus, the sequence biases remain too numerous to assert 
that non intentional mutations are eliminated by backcrossing. 

The national and regional agencies require several plant generations to 
assess the genomesr” stability of “transformation events” in risk assessment. 
Hovvever, seven backcrosses are generally considered necessary, in an ideal 
genetic context (i.e. vvith only sequences segregating according to Mendelian 
lavvs), to achieve 96-9896 genome similarity betvveen the “converted” variety 


13 https://vvvvvv.upov.int/meetings/en/details.isp?meeting. id-50791 . 
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and the original Elite line, vvhich vvould then be near-isogenic, “ according to 
the accepted expression. Even in the case of backcrossing, of supposedly fully 
independant traits or genes i.e. vvithout haplotype etc., to reach the theoretical 
limit of about 9896 varietal purity, more than 500 million base pairs in the case 
of vvheat vvould not be affected by this genome “cleaning” (Baltes et al., 2017, 
Bertheau, 2019: Glenn etal., 2017, Hedman et al., 2018, Holland, 2007, Hollick, 
2017, Kumar et al., 2014). 

Generally speaking, the number of backcrosses needed to approach this 
theoretical purity varies according to numerous factors such as(i) the size ofthe 
haploid genomesş, (ii) the pangenomes and (iii) degrees of ploidy, (iv)the alleles 
and pseudoalleles involved, (v) the number of knovvn traits to be introgressed 
and conserved, (vi) the sizes and İoci present in the non-Mendelian segregated 
regions, (vil) the proximity betvveen the traitto be introduced (linkage drag) and 
those to be eliminatedl, (viii) the type of gene involved (plant defence reaction 
genes), (ix) the autogamy or not of the species or (x) the meiotic drive, among 
others (Ashapkin et al., 2016, Bertheau, 2019, Dulieu, 2005: Flint-Garcia, 2013, 
Flint-Garcia et al., 2003, Gao et al., 2016, Glenn et al., 2017, Golicz et al., 
2020, Liu et al,, 2018, Kumar et al., 2014: Rodgers-Melnick et al., 2015, Xue 
et al., 2019). This “purification” of the genome by backecrossing is finally more 
complicated vvith non-dominanttraits. Hence, the use of selection by molecular 
markers andi in vitro techniques such as immature embryo cultures, to avoid or 
simplify this lengthy and costly backcrossing. The relative interest of this long 
enumeration of factors affecting the theoretical value stated for back-crossings 
vvas İust to remind us of some of the very many factors affecting this value. 
İn most cases, genome cleaning by backcrossing vvill remain vvell belovv this 
theoretical value, as most of the elements are not carefully evaluated despite 
the companies” knovv-hovv. Thus in the mafority of cases, many scars vvill be 
conserved and passed on to subsequent generations, i.e. tovvard the cultivar. 

The lack of standardised binding guidelines for examining unintended 
modifications and their subsequent elimination or transmission is a significant 
shortcoming of applied mutagenesis. As dansing"” reminds us regarding the 
detection and elimination of unintended changes, İt is a question of vvhich 
method you have the equipment and the knovv-hovv for" 

Accordingly, numerous examples shovv that the Elite and commercial 
varieties can still incorporate the scars and signatures of the transformed 
lines. The profiles of the recombination hotspots of these “aboratory varieties” 
(variable-rate by a factor of 1000 vvithin the maize genome) can be established 
(Rodgers-Melnick et al., 2015: Rodgers-Melnick et al,, 2016). Changes in sites 
vvith lovv recombination rates vvould thus signal artefactual changes. Despite its 


14 are strains vvhich genetic makeups are identical except for fevv specific locations or genetic İoci. 
15 https://vvvvvv.the-scientist.com/lab-tools/nevv-methods-to-detect-crispr-off-target-mutations-30013 
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technical vveaknesses, variety sequencing can trace breeding history through 
SNPs, and probably SMP and MNV (Liu, Z. et al,, 2020, VVachsman et al., 2017, 
VVilkesa et al., 2016). 

More prosaically, scars can be preserved in the offspring for cost reasons 
and market conquest or conservation, by carrying out an insufficient number 
of backcrosses (Briggs, 1938, Glenn et al,, 2017, Luna and Dovvd-Uribe, 2020). 
Genome cleaning by backcrosses is also rather impossible for slovv-grovving 
plants such as trees. İt is also inapplicable to vegetatively propagated plants, 
for vvhich various alternatives are being tested (Ahmad et al.,, 2020). 

As the “memory” of vvater stress remindis us, many of these modifications, 
unintentional, even of somatic origin, are transmitted to the descendants, 
and therefore, are usable in detection (Hofmeister et al., 2020, McCarthy 
et al., 2020, Plomion et al., 2018, VVang et al., 2019). Unlike animals, somatic 
mutations in plants can be transmitted to their progeny (Schoen and Schuliz, 
2019: Simberloff and Leppanen, 2019), vvhile these spontaneous mutations 
are dependent on the parents of the hybrids (Bashir et al., 2014). Conversely, 
the heritability of the modifications due to CRISPR-Cas9$ and its efficacy varies 
(Mao etlal., 2017). More generally, the estimation of the heritability of the traits 
Of interest, and the unintended changes, is highly variable, vvhich vveakens 
vvhat is expected from the theoretical efficiency of backcrosses (Visscher et al., 
2008). 

The intergenerational transmission of unintentional changes also applies 
to epigenomes (Sharma, 2013). Indeed, DNA methylation polymorphism due 
to in vitro cultures can be preserved after more than six backcrosses (Filipecki 
and Malepszy, 2006, Kumar et al., 2014). Conservation over 13 generations 
of expected transient modifications is also observable in animals (Svoboda, 
2020). Part of the non-Mendelian heritability, such as maternal effect, is novv 
attributed to transmissible RNAs, never reportedi in risk-evaluation studies. 
These paramutations are responsible for at least some transmissible epigenetic 
regulation (Bonduriansky and Day, 2009, Grandiean et al.,, 2013, Liebers 
et al,, 2014, Roach and VVulff, 1987). Moreover, it has long been recognised 
that methylation blocks, the DMRss, present patterns characteristic of species 
and cultivars influenced by the cells" history (Ashapkin et al., 2016: Eichten 
and Borevitz, 2013, Eichten et al., 2013, Li, Y. et al., 2017, Xu et al., 2020). 
The degree of DNA methylation is one of the causes of the recombination 
hotspots, self-recognition, and signature of sequence insertions, as mentioned 
earlier. 

GVVAS studies allovv the analysis of the many past events that have 
affected genomes. Nevv elements are expected to become avallable vvith the 
sequencing in progress (1000 and 10000 genome proifects) follovved by the 
1000 epigenomes already proposed (Ashapkin et al., 2016, Koch, 2016: The 
1001 Genomes Consortium, 2016, Van de Peer and Pires, 2018). The resulting 
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evolutionary selection signatures15 can then differentiate betvveen natural and 
artefactual mutations (Ayalevv et al., 2020, Stephan, 2019: Xie etal,, 2015). More 
generally speaking, the cell lines can be distinguished by the inheritance of 
certain traits almost independently of (back)crosses (Bertheau, 2019, Braatz 
etal., 2017, Burian etal., 2016, McKenna etal.,, 2016, Yuan etal., 2018a,b). These 
modifications can be so vvell preserved that the laboratory at the origin of the 
induced changes can be identified (Alley et al., 2020, Nielsen and Voigt, 2018). 

These scars and signatures, vvhich participate in identifying Elite varieties 
and cultivars and their past, despite (back)crosses, vvere previously mentioned 
about inbred lines, mostly due to breeders: habits, knovv-hovv and available 
material. Such biomarkers are part of the identification methods of varieties 
currently standardised at the ISO and UPOV. 


4.2 Distinguishing natural and non-natural modifications 


One oftthe arguments often put forvard against the regulation of NBT products 
istthe inability to distinguish natural mutations, vvhich are mutations that appear 
spontaneously vvithout direct human intervention on the cells, from artefactual 
modifications caused by the techniques behind GMOs - vvhether exempt or 
not - an impossible distinction that mightthus make this regulation inapplicable 
(European Group on Ethics in Science and Nevv Technologies (EGE), 2021: Haut 
Conselil des Biotechnologies (HCB), 2017, Scientific Advice Mechanism (SAM), 
2017). Hovvever, these vievvs do not consider the genomes and epigenomes 
from a holistic vievvpoint but rather on the individual constituent levels. Several 
arguments have been provided above for differentiating artefactual from natural 
changes if the context is taken into consideration. Moreover, in vitro mutations 
can be distinguished from in vivo-induced changes by the İlovver frequency of 
the organisms” chimaerism and mosaicism (Frank and Chitvvood, 2016). 

Biotic and ahbiotic stress-inducedi mutations and epimutations are 
transmissible and distinguishable from the spontaneous ones (Ashapkin et al., 
2016, Bednarek and Orflovvska, 2020, Editorial, 2017b, Goldsehmidt, 2014: 
Hahn and Nekrasov, 2019: He and Li, 2018, Kellenberger et al., 2018, Li, S. 
et al., 2017, Niederhuth et al., 2016, Perrone and Martinelli, 2020, Ouadrana 
and Colot, 2016, Rath, 2018, Shahryary et al., 2020, Springer and Schmitz, 
2017, Vitti et al., 2013, Yaish, 2017). 

The spontaneous mutation frequencies, neutral or not, classically reported 
vvould be 107 to 1075, depending on the sequence considered. Hovvever, it is 
essential to clearly distinguish the mutation rates observed during experiments 
from the substitution rates, vvhich are retained changes, a confusion often made 


16 Signatures of selection are defined as the reduction, elimination or change of genetic variation in genomic regions 
that are adiacent to causative variants in response to natural or artificial selective pressure. 
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in expert reports (Exposito-Alonso et al., 2018). The substitution rates in plants 
are betvveen 1075 and 107?” per site and generation, independent of the plants" 
annual or perennial character (Dulieu, 2005, Exposito-Alonso et al.,, 2018, 
VVhittle and dohnston, 2003). Moreover, the somatic mutations transmitted 
from the hybrids are dependent on the parents (Bashir et al., 2014, Dulieu, 
2005, Schoen and Schuliz, 2019). İn conclusion, the frequency of conserved 
spontaneous mutation is about one point mutation per haploid genome 
per generation for almost 700 epimutations per generation, vvhich allovvs 
us to understand the excellent conservation of the tree genomes previously 
mentioned (Ashapkin et al., 2016, Kuhlemeter, 2017). 

The molecular traces left in progeny define and identify histotypes, cell 
stratifications and lineages (Benitez et al., 2018, Bertheau, 2019, Burian 
et al., 2016, Ledford, 2017, McKenna et al., 2016, Scheres, 2001, Zhou et al., 
2020). Several tissue markers are thus able to trace the history of cells across 
generations. These induced markers have been effectively used in plant 
breeding since the 2000s, vvith the development of the TILLING technique and 
the corresponding variety identification by markers and selection signatures 
(Anderson et al., 2016, Gregory et al., 2008: Till et al, 2009: Flint-Garcia, 2013, 
McCallum et al., 2000a,b, Unterseer et al., 2014:). 

İn summayy, any stress, biotic or abiotic, induces genetic and epigenetic 
modifications transmitted to the descendanits, and the rest of this chapter aims 
to identify these unintentional modifications, verify that they are transferred to 
the descendants even in commercialised plants, and organise the resulting 
data to determine the corresponding GMOs and the technique behind them. 

Epigenetic scars and signatures accompany all genomic scars and 
signatures oftransgenic GMOs butare never documented inthe risk assessment 
files (Ashapkin et al.,, 2016, Ben Ali et al,, 2014, Doerfler, 2019, Filipecki and 
Malepszy, 2006, Vilperte et al., 2016, VVaminal et al., 2013), vvhile epimutations 
are knovvn to favour mutations and vice-versa (Mendizabal et al., 2014: Ritter, 
and Niederhuth, 2021). As Doerfler (2019) pointed out, the genomic and 
epigenomic, temporary or transgenerational, sequelae of manipulations 
have never been systematically investigated vvhile inducing genetic mutation 
susceptibility (Mendizabal et al., 2014). 

As previousİy stated, current risk assessment has remained at the genetic 
knovvledge level of the 1970s. İn conclusion, a holistic vievv of genomes and 
epigenomes can easily differentiate traits from natural changes from traits due 
to artefacts. 


5 Gathering the data into manageable decision tools 
İdentifying these signatures and their grouping into coherentsets is a science that 


remains to be developed despite the matrix approach/s routine successes (see 
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Chapter 4). The organisation and use of these coherent sets vvould be based on 
the multiparametric combination of signatures andi scars, follovving the biometric 
strategies1” of the facial, iris or fingerprint recognition methods that are vvidely 
used (Bertheau, 2019: Lievv et al., 2005, Menotti et al., 2015, Unar etlal., 2014). 


5.1 Databases and decision tools 


İn the case of NBT producis, this screening could integrate the claimed traits, or 
even, according to the databases built and the feedback, other targets such as 
factors for identifying the families of varieties such as breeding signatures. The 
detection of other molecular targets to verify the artefactual nature of the NBT 
product vvould only be implemented occasionally, in the eventofla challenge to 
the detected products GM nature. The convergent beam of evidence, possibly 
integrated into decision support systems, vvould make it possible to remove 
any ambiguity regarding natural or artefactual mutations. 


5.2 Routine detection of “hidden”? transgenic and “"nevv”" GMOs 


ldeally, the identification and quantification of all kinds of GMOs should be 
performed vvithout any additional cost. Thus, the best case vvould be that notifiers 
provide the reference material and methods to identify and quantify their products 
according to the current legislative frame on GMOs. VVe vill consider belovv the 
vvorst case, that of unapproved GMOs, by considering all the biomarkers made 
available through the above text. A very simplified routine scheme is proposed in 
Fig. 2, using the matrix approach already described in another chapter in this book. 
The situation for detecting unknovvn nevv GMOs is essentially similar to that 
fortransgenicGMOsand biothreats targeted by biodefense(Hedmanetal.,2018, 
VValper et al,, 2018). The European experience vvith unknovvn transgenic GMOs 
proves its vvorth. Knovvledige management and the matrix approach are already 
functioning (Arulandhu et al,, 2016, Barbau-Piednoir et al,, 2015, Cankar et al., 
2008: Chaouachi et al., 2005, Debode et al., 2019: Fraiture et al., 2020, Gerdes 
et al.,, 2012a, Holst-lensen et al., 2013, Holst-d9ensen et al., 2011, Holst-densen 
et al,, 2012, Milbury et al., 2009, Nesvold et al., 2005, Prins et al., 2016, Ruttink 
et al,, 2010, Tengs et al., 2010). The scientific and technological vvatch already 
plays a significant role in detecting uncontrolled dissemination, but NGOs and 
vvhistle-blovvers are also essential and vvelcomedi sources of information. İt is up 
to the ENGL and the third countries” scientists to take up these “nevv” and “idden 
GMOsr”for developing the necessary body of knovvledge, as previously done for 
transgenic GMOs, and then to pass the validated methods to routine labs for 
compliance vvith the rules of the Cartagena protocol on Biodiversity . 


17 https://tovvardsdatascience.com/biometric-authentication-methods-61c96666883a 
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Accordingly, from a practical vievvpoint, the routine labs vvould have 
only to use a quantitative PCR targeting the claimed trait(s). İn the event of a 
dispute on the natural or artefactual origin of the feature, they could refer to 
the enforcement labs: validation. İt is only in the very fevv expectable cases of 
an intense dispute that the routine labs vvould have to perform some additional 
PCR on a fevv scars or signatures of the presumed GMO. The analysis costs 
vvould thus not drasticalİy increase because, after a very probable phase of 
tests by the operators to assess the methods” robustness, the results of the 
matrix approach identification scheme vvill not be questioned for long. 


5.3 Signatures and scars in processed products 


Processed food products often result in a mixture of components, making 
species no longer recognisable vvhile reducing to extractible nucleic acids 
sizes. As GMO detection has proven so far, the biomarlkers (species present) 
can be detected, and, above all, organised and used in a matrix approach. 
ldentifying a GMO in a food processed from several species means that the 
analysts have to manage a larger matrix than previousİy performed. Design of 
experiments (DOE), machine learning, artificial neural netvvorks, fuzzy logic, or 
genetic algorithms are some ofthe available tools to manage the big data that 
knovvledge matrices could become (Alley et al., 2020, Nielsen and Voigt, 2018, 
Sivaralah et al,, 2017, Yang et al., 2020, Yin et al.,, 2017). The management of 
millions of SNPs used in genomic selection shovv it is easily manageable. 

Figure 1 of the previous chapter and Fig. 2 present very simplified data 
analyses schemes for identifying the GMOs of an agri-food product. Some 
determination criteria such as epigenetic and epitranscriptomic modifications 
have not been incorporatedl, for ease of reading. Hovvever, the principle remains 
the same: after organising the data, the analyses by dichotomous choices 
classically used in the detection of transgenic GMOs viill allovv, vvith the help 
of decision support tools (tables, DSS, artificial intelligence), to detect an NBT 
product, to verify that the traits used in the detection and quantification are not 
natural because they are integrated into an artefactual context, the unequivocal 
signature of the GMOs. The convergent flovv of evidence given as an example 
in Fig. 2 is particularly illustrated in the case of an SDN1 or SDN2 modification, 
an SDN3 change vvould be identified by the univocal signature of the insert/s 
edge fragments, as for transgenic GMOs. 


5.4 The issue of reference material 


The availability of reference material could face tvvo issues. 
The first concerns the avallability itself. Companies could refuse to provide 
reference material vvhile arguing that third countries do not consider the 
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products of NBT as GMOs. Contentious situations vvill probably rise rapidly 
until the consumers" attitudes tovvards those GMOs vvill be straightforvvard 
enough and the opportunities of the European markets considered relevant 
by the companies. 

The second concerns the nature of the reference material. 

The recent trend to develop alternative cost-effective and replicable 
reference materials, such as plasmids or amplicons, vvill hamper GMO detection 
capabilities. The nevvly defined GMOs need to be analysed in terms ofthe entire 
genome, and probabily later the epigenome or even the epitranscriptome. 
Onİy DNA and RNA directly extracted from vvhole plant tissue could provide 
the several selection signatures, and the techniques, scars and signatures 
related to the method of mutagenesis, required by the several analyses to be 
performed in case of litigation. This need vvas already discernible for transgenic 
GMOs vvhen considering the vvhole set of targets for the matrix approach (Ben 
Ali et al., 2018, Collonnier and Bertheau, 2004: Gruden et al., 2013, Taverniers 
et al,, 2013). This general need to preserve reference materials comprising a 
complete, original genome and not PCR-amplified erroneous copies vvithout 
specific epigenetic patterns vvill be iust as prevalentvvith non-transgenic GMOs. 
Some promising screening techniques vvill facilitate these matrix approaches 
(Takabatake et al., 2020a,b). 

Therefore, it is fundamental that the EU should refect, by appropriate 
regulation or recommendation, any reference material that does not provide 
a complete representation of the organism”s genome and epigenome, sensu 
lato. This obligation vvould make it possible to avoid the misadventures recently 
experiencedi vvith certain reference materials. 


6 Proof of concept 


VVe have provided numerous examples and references shovving that the 
identification of artefactually induced or introduced traits could identify any 
GMOs vvithout additional cost in routine. Hovvever, experimental proof is 
generally needed for proving these considerations. 

At leasttvvo independent approaches are available to build up the proof of 
conceptthat scars and signatures of ravv plant products can identify the natural, 
artefactual or mixed origin of a presumed GM product. The first one consists 
of an experimental approach for vvhich ultra-deep sequencing differences vvill 
be sought betvveen the presumed GMO and its closest supposed relative, 
to establish its identification/quantification profile. The ideal is a series of 
in extenso reproducible comparisons of different techniques on the same 
model organism. Data accumulation could also be achieved by comparing 
the numerous transgenic GMO-certified reference materials and their 
corresponding quasi-isogenic varieties. NBT products vvould then be added, 
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such as the non-brovvning GM fungus obtained by in vitro transient expression 
(data are not available to confirm that there is no unintended insertion) in 
protoplasts of a CRİSPR-Cas insert, from the University of Pennsylvania. 

The first products applying for NBT recognition might allovv the study of 
the effects of several types of techniques. Hovvever, it has to be kept in mind 
that all those NBT products have been obtained by Agrobacterium delivery, 
vvith an additional in vitro stage, and are, therefore, for these tvvo reasons, 
above all, GMOs. lt is indeed the case for the DP-915635-4 maize to be 
examined by EFSA. DP-915635-4 is essentially a GMO for vvhich CRISPR-Cas? 
vvas used in one of the processing steps to facilitate further introgressions. 
Those case studies could include: (i) the Calyxt soybean accepted as a “null 
segregant” via a TALEN insertion mediated by an Agrobacterium insert, (ii) 
the Del Monte “Rose” pineapple GMO vvith its pineapple and tangerine genes 
inserted via Agrobacterium, 5 and (iii) Arctic? Fufi non-brovvning apple from 
Okanagan Specialty Fruitlnc. obtained by RNA interference and “cisgenesis” via 
Agrobacterium plasmid insertion, (iv)the “cisgenic" alfalfa KK179 from Monsanto 
and Forage Genetics İnternational vvith altered lignin content via insertions 
by Agrobacterium, or finally (v) the antibrovvningT” potato lines İnnateTM from 
Simplot Co. obtained through “intragenesis” and RNA interference mediated 
by several sequences insertecl via Agrobacterium, potato lines that take over 
an already long series of failures.?” Aİl these products shovv scars resulting 
from in vitro cultures, and for the most part, from the effects induced by the 
transformation of a modified Agrobacterium plasmid, itself a stress inducer, 
by double-stranded DNA cleavage, a biological mutagenesis tool as used for 
the Arabidopsis insertional mutagenesis collection by INRA Versailles (Gelvin, 
2017, Kleinboelting et al., 2015). 

A second approach vvould gather the curated sequences of all GMO types 
present in the international databases. These big data could then be analysed 
by the species and mutagenesis category used to distinguish the similarities 
and differences, at least genetic, caused to species not mutated by man. 
Despite the numerous errors present in the sequence bases (Bertheau, 2019, 
Steinegger and Salzberg, 2020, Tang, 2020), the large number of sequences 
available, vvhether or not from GMOs, should, vvith a reasoned use of various 
statistical and DSS softvvare and artificial intelligence, make it possible to 
distinguish scars and signatures (Alley et al., 2020, Block et al,, 2013, Cadzovv 
et al., 2014: Guillot et al., 2014: İnterdonato et al., 2020, Koumakis, 2020, 
Nielsen and Voigt, 2018, Yang et al., 2020). Finalİy, some experiments should 
be enough to demonstrate the universality of the concept. 


18 After ö years of vvork 

19 Less prone to bruising and black spot brovvning vvith a lovver asparagine content, that turns under very high 
temperatures into acrylamide. 

20 https://en.vvikipedia.org/vviki/Geneticalİy  modified potato 
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7 Conclusion 


Non-exempt GMOs are derived from mutagenesis techniques that had not 
been proven safe for an extended period before 2001. In vitro somaclonal 
variation and random-induced mutagenesis release vvell-knovvn differentiable 
GMOs. NBfTs are a heterogeneous set of techniques ranging from backcrossing 
betvveen GM and non-GM varieties to grafting betvveen GMOs and non- 
GMOs, and cell transformation by nucleic acids vvith or vvithout nucleases. 
The circulating molecules and genomic and epigenomic residues are crucial 
elements inthe analytical traceability of these GMOs.Transgenesis and genome 
editing techniques share common basic in vitro techniques and biases, vvhich, 
vvith other NBT techniques, make them currentİy GMOs. 

The SDN1/” SDN2 and SDN3 (Sprink et al,, 2016) modifications do 
not fundamentally differ from each other. They all rely on natural and error 
prone DNA repair processes (NHEV, Microhomology-Mediated End-/loining 
(MMEV), or Homology Directed Recombination (HDR)) that are not mastered 
by related empirical techniques such as cell transformation conditions. 
These NBT techniques induce off- and on-target errors, from point mutation 
and indels to structural chromosomal mutations. The SDNs generate 
numerous genetic stresses at the origin of, for example, (i) transpositions 
of mobile elements, (il) distal genetic modifications, (iii) epimutations, (iv) 
copy number variations, (iv) integration of plasmid vector and chromosome 
sequences such as Agrobacterium (the most efficient and still most vvidely 
used vector) or biolistically fragmented inserts transcribed in different sizes. 
SDN3 modifications are the holy grail for users of these nuclease techniques, 
a goal that is difficult to achieve because they are highly dependent on 
cell conditions (cell history, phasis and origin, transformation, and culture 
conditions), all of vvhich contribute to the creation of specific scars and 
signatures. 

OTf-targets are generalİy poorly detected due to their research biased 
by misusing imperfect prediction softvvare and comparator genomes or 
using inappropriate prediction and NGS?) The numerous modifications and 
combinations of the various enzymes (hifiCas?, dCas9, i.e., nickase, base 
editing, prime editing, Cas12a, Cas12b, etc.) make it even more complicated 
to predict unintendedi effects, but alvvays vvith scars and signatures. Moreover, 
some dramatic unintended changes such as chromothripsis look to be induced 
by related techniques and CRISPR-Cas9. 

As Germini et al. (2018) pointed out, the comparison of techniques on 
the same plant models has unfortunately never been carried out. İt vvould, 


21 Sequence directed nuclease (instead of the misleading “site” directed nuclease) 
22 Next Generation Sequencing 
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hovvever, shed light on the types of scars and signatures of all NBT examples. 
İt vvould allovv scientific verification of vvhether the matrix approach to scars 
and signatures enables identifying the NBT products and their techniques, 
follovving the example of vvhat is done at İSO and UPOV to identify varieties. 
These same tools and strategies used for years by firms can manage several 
millions of SNPs, vvhich gives an idea of their capacity, adaptable for SMPs and 
MNVs, and usable in identifying GMOs of All kinds. 

İt is almost certain that efficient management of the numerous different 
characteristics of the GMOs, from random mutagenesis to genome editing, 
vvould endly allovv analysts to identify the technique at the origin of each 
GMO. 

VVhile vve are only beginning to be concerned about detecting products 
from techniques that are about a decade old, nevv candidates could once 
again shake up the landscape of genome editing. The question of vvhether 
other elements of genome integrity preservation, such as the nevv Tetrons” or 
old “targetrons” or the Cre-Lox recombinase system, vvill still be relevant in the 
context of a ”retrohoming”?3 vvith restricted or problematic fields of application, 
takes the forefront (Andrusaite and Milling, 2020, Cech, 1990, Enyeart et al., 
2013, Lambovvitz and Zimmerly, 2004: Millman et al., 2020, Pennisi, 2020a,b, 
Vakulskas and Behlke, 2019). 

Such a proliferation of molecular tools modified in all directions by 
numerous laboratories vvithout rigorous controls or reproducibility studies 
pleads for a strict application of the precautionary principle. All the scientific 
and technical elements for reliable traceability of “hidden” and “nevv" GMOs 
are available, only the political vvill to implement this traceability required by 
consumers is missing. But the problem vvith NBT products is probably more 
economical than in terms of unintended environmental or health effects and 
modifications. Not considering NBT products as GMOs vvould mean not 
having to provide the sequences of the inbred or modified traits and their 
origin. This absence of control by risk assessment agencies vvould allovv 
these companies to benefit from the digital information available in private 
and public sequence databases and not have to share the benefits vvith 
other countries and commuhnities, especially indigenous ones, according to 
the İnternational Treaty on Plant Genetic Resources for Food and Agriculture 
(İTPGRFA) agreemenits, vvhich are part of the biodiversity protocol. VVithout 
access to these detectable sequences and analysis by the CBD, countries and 
communities vvould be hard-pressed to prove that their genetic resources 
have been plundered. 


23 https://vvvvvv.vectalys.com/blog/lentiflash-is-a-fast-and-safe-gene-editing-delivery-technology-for-cre-loxp-models 
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8 VVhere to look for further information 


Beyond. considerations of risk and application of the precautionary principle 
claimed by the European Union (https://eur-lex.europa.eu/legal-content/EN/TX 
T/?uri-LEGİSSUM?63A132042), understanding the political and economic stakes 
underlying the absence of reactions from the European Commission betvveen 
2007 (https://vvvvvv.vvur.ni/en/Publication-details.htm?publicationld-publication 
-vvay-333838353939) and 2020 is essential to understandi its repeated refusal of 
the ENGLs proposals to vvork on the analytical traceability of NBT products. (https 
(//MvVvvv.infogm.org/7228-commission-a-refuse-travail-experts-tracabilite-nouv 
eau-ogm, https://ec.europa.eu/food/plants/genetically-modified-organisms/nevv 
-techniques-biotechnology en, https://publications.irc.ec.europa.eu/repository/ 
handle//1RC67059, https://publications.irc.ec.europa.eu/repository/handle/IRC9 
5246, https://publications.irc.ec.europa.eu/repository/hancile/.IRC69121). 

The position ofthe Commission and ofsome Member States seemsto be afaith- 
ful reflection of the positions of the seed industry and other lobbies, vvho claim that 
any labelling vvould be discriminatory (https://vvvvvv.nbtplatform.org/, https://vvvvvv.sci 
encedirect.com//ournal/nevv-biotechnology/vol/40/part/PA, https://vvvvvv.european 
scientist.com/en/Teatures/the-place-of-europe-in-the-nevv-plant-breeding-landscap 
e-evolution-of-field-trials/, https://vvvvvv.europabio.org/genome-editing-faq/, https:// 
onlinelibrary.vviley.com/doi/full/10.1111/ropr.12366, https://euroseeds.eu/the-socio 
-economic-and-environmental-value-of-plant-breeding-in-the-eu-key-findings/). 

The current absence of analytical methods to control GMO and non-GMO 
products vvill hamper the ability to segregate products and therefore to ensure 
the coexistence of GMO and non-GMO supply chains betvveen products 
intended for food, industry or pharmaceutical production. İt vvill also hamper 
the freedom of choice of European consumers (Bertheau, 2013, 2019). 
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